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A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1.136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S. C. § 1 33). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1 )K Responsive to communication(s) filed on 05 February 2004 . 
2a)D This action is FINAL. 2b)^ This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quay/e, 1935 CD. 1 1 , 453 O.G. 213. 

Disposition of Claims 

4) E3 Claim(s) 1-27.32-83 and 86-137 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) D Claim(s) is/are allowed. 

6) [3 Claim(s) 1-27,32-83 and 86-137 is/are rejected. 

7) D Claim(s) is/are objected to. 

8) Q Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) ^ The specification is objected to by the Examiner. 

10)^ The drawing(s) filed on 17 October 2001 is/are: a)D accepted or b)K objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1 .85(a). 

Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1 .121(d). 
1 1 )D The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-1 52. 

Priority under 35 U.S.C. § 119 

12)D Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 
a)D All b)D Some * c)D None of: 

1 Certified copies of the priority documents have been received. 

2. Q Certified copies of the priority documents have been received in Application No. . 

3. Q Copies of the certified copies of the priority documents have been received in this National Stage 

application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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Art Unit: 2654 Paper #10 

1. Applicant's response to the written restriction requirement (paper #9, mailed 29 Jan 2004) 
was communicated by telephone on 5 February 2004 making an oral election for Group I, with 
traverse. 

Examiner's Amendment 




2. Claim 27 and 59: Delete "pui^nase" from line 2, 

This will remove claims 27 and 59 as linking claims to non-elected groups. 
Delete Claims 28^3^84,^and 

This will delete all non-elected claims. 

3. Claims 1-27, 32-83 and 86-137 are pending. 



4. During a telephone conversation with Richard Bauer on 17 March 2004 a provisional 
election was made with traverse to prosecute the invention of Group I, claims 1-27, 32-83 and 
86-137. Affirmation of this election must be made by applicant in replying to this Office action. 
Claims 28-31, 84, 85 and 138 are withdrawn from further consideration by the examiner, 37 
CFR 1.142(b), as being drawn to a non-elected invention. The applicant authorized the examiner 
to delete this claims as done by examiner's amendment noted above. 

5. Applicant's election with traverse noted above in this Office Action is acknowledged. 
Currently, no ground(s) for traversal were presented and are therefore not found persuasive. 

The requirement is still deemed proper and is therefore made FINAL. 



Abstract 



6. The Abstract of the Disclosure is objected to because the first sentence repeats 
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information given in the title. The first 2 sentences could be deleted. Correction is required. 
See M.P.E.P. § 608.01(b). 



7. The drawings are objected to because the elements of figure 9-13 do not have proper 
reference numerals. Reference numerals for each function need to be placed in the drawings and 
the specification in accordance with 37 CFR 1.84(p)(5). A proposed drawing correction or 
corrected drawings are required in reply to the Office action to avoid abandonment of the 
application. The objection to the drawings will not be held in abeyance. 



8. The disclosure is objected to because of the following informalities: 

The summary of the invention on pages 7-8 (paragraphs 12-14), contain recitations of the 

claims that are contrary to the spirit and intent of CFR § 1 .73: 

"Summary of the invention: A brief summary of the invention indicating its nature and substance, 
which may include a statement of the object of the invention, should precede the detailed description. Such 
summary should, when set forth, be commensurate with the invention as claimed and any object recited 
should be that of the invention as claimed. " 

Permitting cosmetic makeover of claims to be considered part of the disclosure would 
amount to having the claims provide the only specification for themselves. Such invalid circular 
reasoning cannot be allowed. 



Drawings 



Specification 
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For applications originating from foreign patents, the equivalence of a summary can be 
found in sections titled as "Problem Statement", "Proposed Solution", "Approach" & etc. rather 
than the claims. 

Paragraphs 17, 19, 21, 22, 24-29 and 31 on pages 9-12 contain descriptions of the figures 
that should be moved to the detailed description of the invention. 
Appropriate correction is required. 



9. Claims 11, 43, 97-101, 116-120, 126 and 137 are rejected under 35 U.S.C. § 112, second 
paragraph, as being indefinite for failing to particularly point out and distinctly claim the subject 
matter which applicant regards as the invention. 

Claim 11 and 43: The "anti-aliasing filter" is ambiguously claimed. The claim fails to 
indicate where in the process the desired filtering is done. 

Claims 97-101, 116-120, 126 and 137: There is no antecedent in the specification for 
filtering that removes noise and for an error detection algorithm or device. 

10. The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of making 



and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 



11. Claims 97-101, 116-120, 126 and 137 are rejected under 35 U.S.C. 112, first paragraph, 
as failing to comply with the enablement requirement. The claims contain subject matter, which 
was not described in the specification in such a way as to enable one skilled in the art to which it 



Claims 



pertains, or with which it is most nearly connected, to make and/or use the invention. 
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Filtering to remove aliasing effects is taught in the specification but no methods for 
removing various types of undesirable noise are taught or illustrated. 

12. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or 
described as set forth in section 102 of this title, if the differences between the subject 
matter sought to be patented and the prior art are such that the subject matter as a whole 
would have been obvious at the time the invention was made to a person having ordinary 
skill in the art to which said subject matter pertains. Patentability shall not be negatived 
by the manner in which the invention was made. 

13. Claims 1-10, 12-14, 17-27, 32-42, 44-46, 49-72, 75-83 and 86, 87, 89, 92-96, 102-105, 
109-114, 121-125, 128-135 are rejected under 35 U.S.C. § 103 as being unpatentable over Thrift 
(6,188,985) in view of Hoffberg (6,519,564) and Furui (Digital Speech Processing, Synthesis, 
and Recognition). 

As per claim 1, "recognizing free-field audio signals" is taught or suggested by Thrift's 
voice-activated device , figure 1 : 

"hand-held device having a microphone" (his voice- activated control unit 10 - 
which is a compact hand-held unit , col. 2, lines 1-3); 

"a local processor" (his processor lOe) ; 

"extracts a time series of spectrally distinct audio signals features from the 
captured free-field audio signals" (his transmitter lOg in combination with processor lOe , 
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figure 1 which performs all or part of the voice recognition process , col. 3 lines 1-5 where 
Thrift teaches that it is obvious to perform part of the analysis and then transmit the 
partially processed data such as linear predictive coding (LPQ analysis, col. 3, lines 18- 
21 to another computer system for recognition processing - Furui teaches details 
necessary for proper recognition of signals that are omitted from Thrift and Hoffberg on 
pages 231-237); and 

"recognition processor and a recognition memory at the recognition site" (his host 
computer 11 - Furui teaches details necessary for proper recognition of signals that are 
omitted from Thrift and Hoffberg on pages 231-237). 

It is noted that Thrift does not explicitly teach "free-field audio". However, he teaches 
that a microphone is used to capture speech or voice input. Hoffberg teaches that it is well 
known that speech recognition 216 techniques may be used to recognize music style pattern 
recognition 218 , figure 2. The term "free-field audio" is defined in the art as audio that does not 
have any interference from echoes. Capturing audio inside a structure would normally have 
some degree of echo off the walls and would therefore not be "free-field". This term would 
therefore limit the apparatus for recognizing audio to an open space or in a room where walls are 
treated to prevent echoes. It would have been obvious for a person having ordinary skill in the 
pertinent art, at the time the invention was made, that the speech recognition system of Thrift 
which uses a microphone could be applied to receive any audio whether "free-field" or otherwise 
because Hoffberg teaches that speech recognition may be applied to other forms of audio such as 
music and one of ordinary skill in the art would find it obvious that a variety of audio sources 
would include their occurrence in "free-field" environments. 
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It is noted that Thrift and Hoffberg do not teach details for "recognition" based on "a time 
series of spectrally distinct audio signals". However, they clearly indicate that known 
parameters for speech recognition may be used to recognize other audio signals. Furui teaches 
the most common spectral analysis techniques on pages 230-231 which may be used to form 
templates which are compared to real time analysis using the same technique. Known distance 
or correlation measurements are taught on pages 231-237, which yield the parameter that 
indicates which template, is the best match. It would have been obvious to use the parameters 
taught by Furui because Thrift (1997) and Hoffberg (1999) teach that it is necessary to use 
known speech recognition techniques to recognize audio signatures (speech or music, col. 1, 
lines 52-56) and Furui teaches various alternatives that were known in the art before 1989 to 
perform recognition. 

Claims 2, 34, 62, 86, 104, 121, and 122: See claim 1 above. One of ordinary skill in the 
art would find it obvious to employ "analog-to-digital" conversion because the audio input of 
Thrift must be converted to a digital signal in order to be processed by a signal processor (col. 3, 
lines 5-21). It is inherently necessary to perform such a conversion based on the teachings of 
Thrift because if no digital conversion is done, then an analog signal processor would be required 
which is contrary to the frequency analysis of LPC, which is normally done with digital signal 
processors. Furui also assumes that spectral analysis of audio will be done using digital signal 
processing (pages 45-47). 

Claim 3: This is anticipated by the suggested use of LPC extraction as part of a front-end 
process (col. 3, lines 18-22). 



m 
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Claims 4, 21, 25, 36, 64, 79: Thrift shows the host computer connected to the internet 
but Hoffberg shows that the personal computer that audio is input into is a computer with a 
browser with a connection to the internet. Therefore, it would have been obvious that computers 
could be provided with Internet access at various levels because Hoffberg teaches that his 
browser may be connected through the Internet to access another computer (server) that performs 
recognition. 

Claim 5: Thrift clearly teaches that the recognition processor must contain memory with 
audio templates regardless of its location lOf or 1 lb (figure 1). 

Claim 6, 20, 24, 38, 52, 56, 66, 78, 82: Thrift does not require storage of an entire 
audio work before transmitting and by indicating that the control unit may only be a front end 
processor, teaches that it must only start producing parameters capable of transmission. 
Hoffberg similarly teaches that music input does not need to include the whole work but may be 
limited to musical themes (sequence of notes, rhythmic signature, etc). 

Claims 7, 39, 67, 92, 102, 103 and 105: See the musical themes of Hoffberg, which 
would inherently comprise a song. 

Claim 8: As taught by Thrift (noted above in claim 1), it is obvious to perform all or part 
of the recognition on a local or host computer. 

Claim 9: Thrift teaches that the recognition may be performed at the host computer 
where the computations necessary for recognition would take place. 



Claim 10: See claim 1 above. The use of digital conversion is discussed in claim 2 

above. 
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Claims 12 5 44: Official Notice is given that "flash memory" is defined as non volatile 
storage that can be electrically erased in the circuit and reprogrammed. It would be obvious to 
use this type of memory to store any type of data because it has the desirable characteristics of 
being programmable to retain varying data and will not lose data when power is interrupted. 

Claim 13, 45, 71: See the LPC suggested by Thrift (claim 1) represent "different 
frequency bands" and the well known alternatives taught by Furui (pages 230-231) not only 
shows generic frequency band analysis but also teaches the use of psycho acoustically significant 
spectral values known as Bark scale or Mel-scale , page 232. 

Claim 14, 46, 72: The parameters for analysis by Furui and LPC by Thrift are all 
commonly used for compression. Using them for compression is obvious because Thrift teaches 
that LPC used for transmission and compression most commonly performed to reduce the 
bandwidth needed to transmit signals to other locations. 

Claims 17, 49, 75, 110, 123: The use of a "radio receiver" is anticipated by Thrift's 
wireless device (abstract). 

Claim 18: See claim 1 above. The use of "data capturing" is taught by Thrift's 
indication that the unit 10 may capture data that is sent to the host computer 1 1 (col. 3, lines 1-8). 

Claims 19, 26, 27, 40, 51, 58, 59, 68, 77, 83, 87: The ability to provide "a verification 
signal" is taught by Thrift's feedback that comes after recognition of commands , col. 3, lines 57- 
67 which continues to explain feedback from the host system 1 1 may be graphical (Including . . . 
audio), col. 4, lines 2-3. See his data receiver lOh . Further audio feedback is taught in column 4, 
lines 33-67 regarding the ability to recognize questions and supply appropriate answers audibly. 
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Claims 22, 37, 65, 80: See claim 1 above. The ability to store "an entire audio work" is 
taught by Thrift in col. 1, lines 49-52 where he teaches: The idea behind the Web is to collect all 
kinds of data from all kinds of sources, avoiding the problems of incompatibilities by allowing a 
smart server and a smart client program to deal with the format of the data. 

Claims 23, 35, 55, 63, 81: The signal processing details for analysis of a "time series 
waveform" are taught by Furui. Furui teaches the use of spectral distance measures for rank 
ordering the best match. Ordering statistics based on "distinctiveness" is well known for psycho 
acoustically important information as taught by Furui such as the proper spacing of frequencies 
( Bark scale or Mel-scale , page 232 which order the frequencies and bandwidth spacing based on 
the human auditory system) and also on various applications of weighting , pgs. 237-243 which 
help to emphasize important features so they are transformed in a way that improves 
recognition). 

Claims 32, 60: See claim 1 above. "Randomly selecting any one portion of the received 
data stream" is obvious in view of Hoffberg's teaching that A web site thus identified is 
preferably visited a number of times , (col. 3, lines 53-56) for clustering resources with a similar 
profile. Neither particular order for selection of web sites nor any order for visiting a particular 
web site is taught or suggested. Therefore, it would have been obvious to make such a selection 
using any known statistical method for collecting data, which would include a random selection. 

Claims 33, 41, 69: See claim 1 above. Furui teaches correlation techniques. 
"Correlation" could be considered the same as computing a distance (pages 230-243). 
Alternatively, statistical types of correlation calculations include autocorrelation (pages, 231- 
232). 
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Claims 42, 50, 53, 54, 57, 61, 76: See claim 1 above. Performing recognition at a 
"recognition site" is taught by Hoffberg's server 202 connected through the Internet. 

Claim 70 is rejected under similar arguments as applied to claims 1 and 2 above. 

Claim 89, 93-96, 109, 111, 124, 125, 129-134: Providing a "display" is taught by 
Thrift's display 10a , figure 1 and Hoffberg's browser 228 , fig. 2. Hoffberg teaches that the type 
of data to be displayed is in a relational database where the content is indexed by an indexer 220 
in a data base 222 as relating to a certain natural language, one or more music styles , col. 4, 
lines 25-27 and includes bibliographic information items ... for display to the user , col. 5, lines 
20-21. In addition to music other audio data may be streamed as taught by Hoffberg to include 
live radio broadcasts , col. 4, line 39 and news services , col. 4, line 48. Thus, it would obvious to 
include "advertisements" and related information commonly associated with live radio and news 
broadcasts in addition to music related information. 

Claims 112-114, 128, 135: See claim 1 above. A "unique code" is considered the same 
as the templates created for recognition of various sources in the database made up of spectrally 
distinct features. If they were not distinct or unique, then data elements from which the database 
was created could not be distinguished one from another for recognition. The analysis must 
create distinct output for different signals in order for different signals to be distinguished from 
each other. See, for example, the signature of various kinds of music . . . classical music has 
typically a different speech recognition signature than rock music , col. 1, lines 52-56. This what 
may be stored in the database 222 , fig. 2 along with other related data. 
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14. Claims 11, 43, 118, 120, 126, 137 are rejected under 35 U.S.C. §103 as being 
unpatentable over Thrift (6,188,985) in view of Hoffberg (6,519,564) and Furui (Digital Speech 
Processing, Synthesis, and Recognition) as applied to claim 10, 116 and 121 above and further in 
view of Smith (Digital Signal Processing). 

Claim 11, 43: "Anti-aliasing" is defined as removing spurious frequencies from 
waveforms produced by converting digital signals back to analog. Any competent engineer 
would not only be aware of the definition but would know "filtering" techniques to achieve its 
implementation. See, for example, page 48 from Smith which is a treatise on digital signal 
processing showing that an antialias filter is desirable for any data conversion between analog 
and digital formats. 

Claims 118, 120, 126, 137: Using "filtering" to remove noise is a common digital signal 
processing technique. See Smith, pages 368-372. He teaches that it is well known to use a time 
varying Wiener filter on page 369 to remove noise from audio. Therefore, it would have been 
obvious to use a Wiener filter in the digital signal processing required for recognition because 
Smith teaches that reduce undesired wideband noise . 

15. Claims 15, 16, 47, 48, 73, 74, 88, 90, 91, 106-108, 115, 127, 136 are rejected under 35 
U.S.C. § 103 as being unpatentable over Thrift (6,188,985) in view of Hoffberg (6,519,564) and 
Furui (Digital Speech Processing, Synthesis, and Recognition) as applied to claims 10, 42, 70, 
86, 87, 104, 121 above and further in view of Murui (4,959,850) or Grimm (5,907,815). 
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Claims 15, 47, 73, 88, 90, 91, 106, 115, 127, 136: The elements of device 10 (Thrift) 
include common elements to a cellular telephone such as a wireless transmitter lOg , see abstract. 
Therefore, such a device would be within the scope of Thrift because it is well known to affect 
Internet connections using telephones including wireless telephones. Murui and Grimm show 
common cellular telephones, which may include recognition and synthesis (col. 1, line 20 and 
Fig. 4A, Murui) or various voice/data capture capabilities (col. 2, line 20 and Figs. 13-168, 
Grimm). It would have been obvious to modify either Murui or Grimm with the improvements 
of Thrift because Thrift teaches that a hand-held wireless device can be used to capture and pre- 
process audio that can be transmitted for further processing for recognition and to provide greater 
functionality by connecting to the internet. 

Claim 16, 48, 74: The use of a "portable device assistant" is a well known alternative 
taught by Grimm's personal digital assistant "PDA" (col. 2, lines 20-21). 

Claim 107: Using a phone number to establish a connection is inherent in cellular 

phones. 

Claim 108: See claim 1 above for use a microphone to pick up sound. 

16. Claims 97-101, 116, 117, 119 are rejected under 35 U.S.C. § 103 as being unpatentable 
over Thrift (6,188,985) in view of Hoffberg (6,519,564) and Furui (Digital Speech Processing, 
Synthesis, and Recognition) as applied to claim 10 above, further in view of Murui (4,959,850) 
or Grimm (5,907,815) as applied to claims 86, 104 above and further in view of TIA/EIA-627 
(Telecommunications Industry Association standard of June 7 1996) 
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Claims 97-101, 116, 117, 119: For cellular telephones, standards are defined which 
require certain types of noise filtering and error checking. TIA/EIA-627 defines requirements 
for a perceptual noise weighting filter on pages 31-34. To detect errors, TIA/EIA-627 teaches 
the use of a Cyclic Redundancy Check (CRC) on pages 58-60 and 63. Pages 58 shows the CRC 
polynomial generator and page 68 shows how the CRC is actually used to implement masking of 
bad frames that are detected. 



17. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. 

Hopf (4,542,525) and Schulze (4,918,730) are cited to show that it is well known to 
analyze audio for recognition of speech or music. 

Hoffberg (5,963,957) is cited to show that it is well known to create a music database. 

Nilsson (6,049,710), Chuprun (6,385,434), Nilsson (6,405,029) and Mack (6,510,325) 
ares cited to show that various cellular phone devices are well known for combination with other 
types of data collection and processing. 

18. Any response to this action should be mailed to: 

Commissioner of Patents and Trademarks 
Washington, D.C. 20231 

or faxed to: 

TC2600 Fax Center 



Prior Art 
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(703) 872-9314 

Hand-delivered responses should be brought to Crystal Park II, 2121 Crystal Drive, Arlington. 
VA., Sixth Floor (Receptionist). 

19. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to David D. Knepper whose telephone number is (703) 305-9644. 
The examiner can normally be reached on Monday- Thursday 0730-1800. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (703) 305-9645. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 




David D. Knepper 
Primary Examiner 



Art Unit 2654 



